FASTQSUM

Basecall summary

Note: In computational biology, N50 is statistics of a set of contig or scaffold lengths. The N50 is similar to a mean or median of lengths, but has greater weight given to the longer contigs. It is used widely in genome assembly, especially in reference to contig lengths within a draft assembly.

Basecalled reads length

Basecalled reads length for all barcodes

Blue line: Median

Red line: Mean

Explanation: Basecalled reads length represents distribution plot (distplot) to show the relationship between read density on y-axis and basecall length as a logarithmic scale on x-axis for all barcodes in FASTQ file.

Note:
- A distplot or distribution plot depicts the variation in the data distribution.
- A logarithmic scale (or log scale) is a way of displaying numerical data over a very wide range of values in a compact way.

Basecalled reads length for each barcode

Explanation: Basecalled reads length represents histogram plot (histplot) to show the relationship between count as number of reads on y-axis and basecall length as a logarithmic scale on x-axis for each barcode in FASTQ file.

Note:
- A histplot or histogram plot is an excellent tool for visualizing and understanding the probabilistic distribution of numerical data or image data.
- A logarithmic scale (or log scale) is a way of displaying numerical data over a very wide range of values in a compact way.

Quality score summary

Explanation: The quality score summary table shows the descriptive statistics information divided by each barcode arrangement.

Basecalled reads PHRED quality

Red line: Cut-off line suggestion (Mean quality score at 8.0)

Explanation: Basecalled reads PHRED quality plot represents the frequency distribution of mean quality score in each barcode arrangement.

Number of reads per quality score

Explanation: Number of reads per quality score plot represents the proportion between the number of the passed and failed reads.

Length VS Score summary